Adjusting dysarthric speech signals to be more intelligible

نویسنده

  • Frank Rudzicz
چکیده

This paper presents a system that transforms the speech signals of speakers with physical speech disabilities into a more intelligible orm that can be more easily understood by listeners. These transformations are based on the correction of pronunciation errors y the removal of repeated sounds, the insertion of deleted sounds, the devoicing of unvoiced phonemes, the adjustment of the empo of speech by phase vocoding, and the adjustment of the frequency characteristics of speech by anchor-based morphing of he spectrum. These transformations are based on observations of disabled articulation including improper glottal voicing, lessened ongue movement, and lessened energy produced by the lungs. This system is a substantial step towards full automation in speech ransformation without the need for expert or clinical intervention. Among human listeners, recognition rates increased up to 191% (from 21.6% to 41.2%) relative to the original speech by using he module that corrects pronunciation errors. Several types of modified dysarthric speech signals are also supplied to a standard utomatic speech recognition system. In that study, the proportion of words correctly recognized increased up to 121% (from 72.7% o 87.9%) relative to the original speech, across various parameterizations of the recognizer. This represents a significant advance owards human-to-human assistive communication software and human–computer interaction. 2012 Elsevier Ltd. All rights reserved.

منابع مشابه

Comparing Humans and Automatic Speech Recognition Systems in Recognizing Dysarthric Speech

Speech is a complex process that requires control and coordination of articulation, breathing, voicing, and prosody. Dysarthria is a manifestation of an inability to control and coordinate one or more of these aspects, which results in poorly articulated and hardly intelligible speech. Hence individuals with dysarthria are rarely understood by human listeners. In this paper, we compare and eval...

متن کامل

Intelligibility of modifications to dysarthric speech

Dysarthria is a motor speech impairment affecting millions of people. Dysarthric speech can he far less intelligible than that of non-dysarthric speakers, causing significant communication difficulties. The goal of this work is to understand the effect that certain modifications have on the intelligibility of dysarthric speech. These modifications are designed to identify aspects of the speech ...

متن کامل

Acoustic transformations to improve the intelligibility of dysarthric speech

This paper describes modifications to acoustic speech signals produced by speakers with dysarthria in order to make those utterances more intelligible to typical listeners. These modifications include the correction of tempo, the adjustment of formant frequencies in sonorants, the removal of aberrant voicing, the deletion of phoneme insertion errors, and the replacement of erroneously dropped p...

متن کامل

Can modified casual speech reach the intelligibility of clear speech?

Clear speech is a speaking style adopted by speakers in an attempt to maximize the clarity of their speech and is proven to be more intelligible than casual speech. This work focuses on modifying casual speech to sound as intelligible as clear speech. First, we examine the role of speaking rate for intelligibility. Clear and casual speech signals are time-scale stretched, matching the average d...

متن کامل

Vocal tract representation in the recognition of cerebral palsied speech.

PURPOSE In this study, the authors explored articulatory information as a means of improving the recognition of dysarthric speech by machine. METHOD Data were derived chiefly from the TORGO database of dysarthric articulation (Rudzicz, Namasivayam, & Wolff, 2011) in which motions of various points in the vocal tract are measured during speech. In the 1st experiment, the authors provided a bas...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

متن کامل
عنوان ژورنال:
  • Computer Speech & Language

دوره 27  شماره 

صفحات  -

تاریخ انتشار 2013